A Tour to Spur for 
Non-VM Experts 


Guille Polito, Christophe Demarey 
ESUG 2016, 22/08, Praha 



RIStAL 


Centre de Recherche en Informatique, 
Signal et Automatique de Lille 



INUENTEUR5 DU MONDE NUM^RIQUE 



From a user point of view 

We are working on the new Pharo Kernel 

• Bootstrap: create an image from scratch 

- Classes 

- Global objects 

- Processes and contexts 

• Image Initialization: What is the correct order? 

BTW, see our talk on this ;) 

Mission Pharo Kernel, Thursday 10 am 



What is this talk about? 





Dec 14, 2015; 11:08am 

[IMPORTANT] Starting migration to Spur 
VM 





Initial Motivation to look into Spur 



1) How do we move to Spur 

as fast as possible? 


2) Should we / 

How do we adapt to it? 






What are the risks? 


Motivation of this talk #1: Education 


Explain what is Spur 

Determine if a problem comes 
from image side or VM side? 



Motivation of this talk #2: 
Understanding the Impact 

Is my application compatible? 

Will It break? Do I have to port it? 


Part 1: 

Demolishing Myths 


What is Spur? 



spur is not a new Virtual Machine 

Its underlying execution engine is the same as in Cog 
(same bytecode, same interpreter, same JIT compiler) 



spur is not a new Garbage Collector 

It just implements a new garbage collector 
(which, BTW, is not new...) 



spur is not a new Object Format. 

It just implements a new object format 
(which, BTW, is just the means to an end) 



So... what is Spur? 


Spur is a new Memory Manager for Cog VM. 

- New object representation in memory 

(that allows ephemerons, pinned objects,...) 

- New memory organization of Pharo images 

(that allows to better manage resources) 






Spur in a Nutshell 


It's a Cog VM 

+ 64 bits support 
+ faster: xl.8 speedup 
+ larger images (> 2 Go) 
+ ephemeron support 
and more ... 



Spur > 64-bits support 


No more need to install 32-bits libraries 


sudo dpkg — add-architecture 1386 

sudo apt-get update 

sudo apt-get install libxll-6 : 1386 

sudo apt-get install libgll-mesa-glx: i386 

sudo apt-get install libfontconf igl: i386 

sudo apt-get install libssll* 0. 0: i386 


Images with size > 2 Go 



Spur > faster access to classes 

Class Table 

• Direct access to 
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Spur > faster 
Garbage Collector 

• “Young objects die young (and fast)” 

• Added survivor segments (future and past) to 
the young space 

=> allows more minor GC instead of major GC 

Minor GC 

Major GC (mark & sweep) (scavenger) 



future 

past 

CD 

Q. 

CD 
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Old space 


Young Space 


Spur > Fast become 


No more hangs in large images when using #become: 

(e.g. Moose with a big famix model) 

Why? Spur introduces forwarders 

• prevents to scan the whole memory 

• replaces pointers when they are accessed 

• implemented by a partial read barrier 1 

Cheap in most cases (just one indirection) 

Costly if you rely a lot on primitives fallback 

1. Eliot Miranda, Clement Bera. A Partial Read Barrier for Efficient Support 
of Live Object-Oriented Programming. ISMM'15 



Spur > faster 
Immediate objects 


New immediate objects 

• Character 

• Float (only 64-bits) 


Binary representation of object pointers ; x is 1 or 0 
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First bits are 000; this is a direct pointer to an object in the heap 
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First bit is 1 ; this is a Small Integer instance (63 bits signed int) 
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First bits are 010: this is a Character instance 
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First bits are 100; this is a Small Float instance 


Speed-up in wide strings 

Speed-up in float arithmetic and memory saving 


Spur > other features 


Spur object format: 
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number of slots 
identity hash 
is grey ? 
unused bits 


f 

object format 

z 

is remembered ? 

c 

class index 

n 

is marked ? 

& 

is pinned ? 

i 

is read only ? 


• All classes are compact 

=> only two kind of headers (3 before Spur) 

• Support for pinned-objects (see UFFI talk on Friday) 

• Ongoing support of read-only objects 

• Still 2 unused bits 


Spur > scalability 


• Memory is now divided in several segments 

• No more need to have a contiguous chunk of 
memory 



Memory 

segment 


Memory 

segment 


Memory 

segment 


Memory 

segment 




Spur > reliability 


• Ephemeron finalization support 

• Avoid memory leaks 


BTW, see our OTHER talk on this ;) 

A Weak Pharo story, Thursday, 3 pm 



Part 2: 

Porting applications and 
frameworks to Spur 




How do I port 
my application 

to Spur? 



Porting Applications 


Q NOTHING TO 

DO HERE 

c? r * 




Porting Applications 




Porting Applications 


Okay, maybe just wait that your developer friends 

port your favorite frameworks. 



Porting Frameworks/Libraries 




Porting Frameworks/Libraries 


Porting Basics #1 

The number hierarchy changed 



• Beware if you have visitors 

• Beware if you have overrides 





Porting Frameworks/Libraries 

Porting Basics #2 

Character is now immediate 






Beware if you have overrides that use the 
internal state 




Porting Frameworks/Libraries 


Porting Basics #3 

New (enhanced) ephemeron finalization 



Epln &r> wo mR 



• If you need finalization you'll probably want to 
use the new one 

BTW, see our OTHER talk on this ;) 

A Weak Pharo story, Thursday, 3 pm 





Porting Frameworks/Libraries 


Porting Basics #4 

Native Boost is being deprecated 

Native Boost ► UFFI 


If you are using FFI, you will need to review your 
bindings 







Spur Behind the Scenes 



VM development hosted on GitHub 
OpenSmalltalk / opensmalltalk-vm 


ED 


A GitHub, Inc. 


a o ^ 


O Personal Open source Business Explore Pricing Blog Support This repository 


Sign in 


B OpenSmalltalk / opensmalltalk-vm 


<*> Watch 10 ’fr Star 36 ^Fork 9 


OCode Issues 8 Pull requests c 4*- Pulse Graphs 


Cross-platform virtual machine for Squeak, Phare, Cuis, and Newspeak. 


©1,770 commits 


10 branches 


^>1 release 


1116 contributors 


Branch: Cog ▼ New puli request 


is fniephaus Update link with info on how to use "ht" vms 


I . glt.fi Iters 
I build. linux32ARMv6 
I build Jnux32ARMv7 
I build. Iinux32x86 
I build Jnux64x64 
) build.macos32x86 
i build. macos64x64 
I build.win32x86 
i build.win64x64 
I image 


Remove superstitious code [skip ci] 

Force remove conffg.h in mvm scripts 

Force remove config.h in mvm scripts 

Force remove config.h in mvm scripts 

Force remove config.h in mvm scripts 

Enable automatic graphics card switching on macOS 

Make scripts fail-stop. 

Don’t use the provided 3rd party DirectX include files 
Don’t use the provided 3rd party DirectX include files 
Make scripts fail-stop. 


Find file 

Latest commit &6292S1 3 days ago 

2 months ago 
12 days ago 
12 days ago 
12 days ago 
12 days ago 
16 days ago 
21 days ago 
19 days ago 
19 days ago 
21 days ago 


Why is it a good news? 

• Brings together the VM community 

• Easier to contribute 

- Pull requests 

- Issue tracker 

- Documentation: 
https://github.com/OpenSmalltalk/opensmalltalk- 
vm/blob/Cog/CONTRIBUTING.md 



VM build all flavors through 

Travis Cl 
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Still missing VM tests. Upcoming? 


Where to find VM binaries? 

• Pharo J Virtual Machines 

http://files.pharo.org/vm/ 


*1* JFrogBintray 


Squeak, NewSpeak 

I 1 I 01 % I E+- II / / 

yJJ opensmalltatk/vm / cog 



Please see the package readme for further information. 


General 

Readme 

Release Notes Reviews fO) 

Statistics 

Files 

About This Package 

Version Notification Links <§> 

Versions 


Website 

https ://gi th u b.co m/0 pen § mal Ita I k/vm 

2G1 6081 41 01 4 


Issue Tracker 

h ttps://gi th u b . co m/G pe n 5 ma I Ita 1 k/v-m /I ss u es 

201606301459 








Conclusion 



Should I move to Spur? 


1,8 


1,6 



BubbleSort Fannkuch Json Nbody Permute Quicksort Storage 

Bounce DeltaBlue GraphSearch Mandelbrot PageRank Queens Sieve Towers 


■ Pre-Spur 

■ Spur 


by Stefan Marr, Apr 06, 2015 



Should I move to Spur? 


64-bits support 

Increased performances: xl.8 speedup 

Scalability, Reliability and open to new 
features 

image not compacting anymore (will be fixed soon) 



EXTRA SLIDES 



Dissecting Spur... 



i l) Class tables 

2) Forwarders 

3) Ephemeron Finalization 

4) The Scavenger GC 






Chapter 1 

Classes are in Tables 
(and they hide in tables) 




1.1 The old object header... 



1.1 Compact classes 


Header type 3 

Compact Class 


Header 

S 

lot 

1 



Smalltalk compactC I asses Array 



ObjectMemory»#baseHeader; 


■ lyp* ■ pet ■ 

Garbage Collcttar (^kHarked;) 

Mart : used lo mark objects during mark phase 
Root : set if object is a root object. 

- the third bit is unused 


Compact Class index. (#compactO ass Index Of]) 

non-zero value if the class of the object pointed is a compact, 
represent the index in the compa-aClass array 

Object Size (ftslicBJtsOf) 

The object size in 32-hit machine words. 

For objects with size which don‘t fit in this fidd. we p*it all 
ones. 

and store object size in separate 32-bit word of object 
header (header type == -0} 


Objoct Hash (#hashBits€tf: ) 

12 bits wich some random walge. assigned at object 
creation. See didentityH-ash 


Object Format (#formatOf:) 

I 6 possible values describing the type of dhe object. 
An exhaustive list m a dedicated slide 

Header Type {#headerType:) 

C 3 '-Vards Objects |Srce and Qau;) 

I 2 v*ifcf4s Objects (ewaiy dw, whith rwt compact das*J 
2 : Wbnd free 

3 I YSferk ofejnct <u5^lly {impact class) 




1.1 Cons of the old object header 


• A full word is used to indicate an object's class 

- 4G classes in 32 bits 

- 16E (2 A 60) classes in 64 bits (!!) 

• Three different headers 

=> checks for the header type are common 



1.2 New class header 


Spur's object header 
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number of slots 


Q] object format 



remaining bits 


|~h| identity hash [c] class index 




1.2 Class table 
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1.2 Class table 




1.2 Pros of the new object header 


• 2 A 22 classes (4M). Still enough and efficient. 

• Compatible with 64bits 

• All classes are compact => only two kind of headers 



1.2 Hidden objects 




1.2 Hidden objects? 


• The class table is an object (and lives in the heap) 

• Its class index is “hidden”: 

- Array allinstances 

will iterate objects by class index 

• In the class table: 

- Indexes 0-15 are reserved for tagged objects 

- Indexes 16-32 are reserved for hidden classes 



1.3 Maintaining the class table 


Classes are normal objects... 

They are created with no special primitives... 

But... 

How does the VM know an object is a 
class to put it into the class table? 



1.3 Identifying classes by definition 


A class is an object that is instantiated: 


A class enters the class table 
upon instantiation 



1.3 But the index is the hash! 


But... hashes are assigned lazily for all objects: 

Classes, on instance-side, 
define a special hash method 

Behavior » basicldentityHash Object » basicldentityHash 


<primitive: 175> 
self primitiveFailed 


<primitive: 75> 
self primitiveFailed 



Chapter 1 - Conclusions 


• Classes are organized in tables 

• All classes are compact 

• Simpler object header 

• Still place for 4M classes 

• On the image side, is almost transparent 



Chapter 2 

The forwarder plague 


2.1 Become 


• Swaps two objects 

- (actually, swaps two object's identity) 

• Useful for: 


- Updating and migrating objects 

- Install proxies 

- Replace an object's behavior 



2.1 The old become 


• Full scanned all memory 

• And was SLOOOOOW 



2.1 Lazy become 



Eliot Miranda, Clement Bera. A Partial Read Barrier for Efficient 
Support of Live Object-Oriented Programming. ISMM'15 







2.1 Lazy become 



Eliot Miranda, Clement Bera. A Partial Read Barrier for Efficient 
Support of Live Object-Oriented Programming. ISMM'15 







2.1 Lazy become 



Eliot Miranda, Clement Bera. A Partial Read Barrier for Efficient 
Support of Live Object-Oriented Programming. ISMM'15 






2.1 Lazy Become 



b doSomething 


Eliot Miranda, Clement Bera. A Partial Read Barrier for Efficient 
Support of Live Object-Oriented Programming. ISMM'15 





2.2 The read barrier 


• A full read barrier would be too expensive 

- (on every read, on every primitive, on every 
message send...) 


• The read barrier is implemented in two places: 

- Message send lookup failure 

- Primitive failure 


Eliot Miranda, Clement Bera. A Partial Read Barrier for Efficient 
Support of Live Object-Oriented Programming. ISMM'15 



2.2 Message send lookup failure 


method := (self lookupSelector: selector inClass: class), 
method ifNil: [ 

(receiver isForwarder) ifTrue: [ 

receiver := receiver forward. 

“scan also the objects in the stack” ]. 

method := (self lookupSelector: selector inClass: class). 

]■ 


Eliot Miranda, Clement Bera. A Partial Read Barrier for Efficient 
Support of Live Object-Oriented Programming. ISMM'15 



2.2 Primitive failure 


self performPrimitive: primitiveNumber. 
self primitiveFailed ifTrue: [ 

“scan the stack looking for forwarders and retry” 
self performPrimitive: primitiveNumber. 


Eliot Miranda, Clement Bera. A Partial Read Barrier for Efficient 
Support of Live Object-Oriented Programming. ISMM'15 



2 Conclusions 


• Become does not need full scan anymore 

• A forwarder replaces the object in place 

• Two-way become copies object at the end 

• Forwarders are bypassed using a partial read barrier: 

- Message lookup failure 

- Primitive failure 

• No noticeable overhead 



3.5 Scavenger GC 


• “Young Objects Die Young (and quick)” 

• Young objects are created in eden 

• Objects are “tenured” after surviving several 
generations 

• Tenured objects go to old space 



New Space 


3.5 Scavenger GC 


Old space 



future 

past 

CD 

Q. 

CD 
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3.5 Scavenger GC 

Old space 


• Mark and Sweep (marking collector) 

• Runs “every blue moon” on the entire memory 


Slow 


New Space 


CD 

future past cd 

• Scavenger (copying collector) 

• Runs often, only in new space 

• Object tenure (to old space) 
depends on the ratio of 


allocation 








3.5 Scavenger GC 

New Space 


future past 


1) Future is always empty during execution 









3.5 Scavenger GC 

New Space 


future past eden 


1) Future is always empty during execution 

2) On a GC, past and eden objects that are referenced are copied to future 









3.5 Scavenger GC 

New Space 

past future eden 


1) Future is always empty during execution 

2) On a GC, past and eden objects that are referenced are copied to future 

3) Then, future and past spaces are swapped 









3.5 Scavenger GC 

Two questions remain: 

• How does the scavenger do a GC without 
iterating the entire heap? 

• How does he know object ages? 



3.5 Scavenger GC 

Two questions remain: 


• How does the scavenger do a GC without 
iterating the entire heap? 

It maintains a set of “objects in new space referenced from old space” 

• How does he know object ages? 


By their addresses! Lower addresses are younger.... 


Is that all? 


Pinned objects? 

The finalization queue? 

Memory segments, bridges, ...? 

(The not working) Memory compaction? 
New immediate objects? 



